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Abstract: Human coronavirus OC43 (HCoV-OC43) is one of the causes of the “common cold” in 
human during seasons of cold weather. The primary function of the HCoV-OC43 nucleocapsid 
protein (N protein) is to recognize viral genomic RNA, which leads to ribonucleocapsid formation. 
Here, we characterized the stability and identified the functional regions of the recombinant HCoV- 
OC43 N protein. Circular dichroism and fluorescence measurements revealed that the HCoV-OC43 
N protein is more highly ordered and stabler than the SARS-CoV N protein previously studied. 
Surface plasmon resonance (SPR) experiments showed that the affinity of HCoV-OC43 N protein 
for RNA was approximately fivefold higher than that of N protein for DNA. Moreover, we found that 
the HCoV-OC43 N protein contains three RNA-binding regions in its N-terminal region (residues 
1-173) and central-linker region (residues 174-232 and 233-300). The binding affinities of the 
truncated N proteins and RNA follow the order: residues 1-173-residues 233-300 > residues 174- 
232. SPR experiments demonstrated that the C-terminal region (residues 301-448) of HCoV-OC43 N 
protein lacks RNA-binding activity, while crosslinking and gel filtration analyses revealed that the 
C-terminal region is mainly involved in the oligomerization of the HCoV-OC43 N protein. This study 
may benefit the understanding of the mechanism of HCoV-OC43 nucleocapsid formation. 
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Introduction 

The strain OC43 of the type II coronavirus family 
(HC0V-OC43) was first identified in the 1960s and is 
responsible for ~20% of all “common colds” in 
humans. 1,2 Although HC0V-OC43 infections are gener¬ 
ally mild, more severe upper and lower respiratory tract 
infections like bronchiolitis and pneumonia have been 
documented, especially in infants, elderly individuals, 
and immunocompromised patients. 1,3,4 Moreover, there 
have been reports that clusters of HC0V-OC43 infec¬ 
tions cause pneumonia in otherwise healthy adults. 2,5 
Several studies have reported that both neurotropism 
and neuroinvasion of HCoV are associated with multi¬ 
ple sclerosis, especially in the case of the OC43 strain. 6 
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CoV particles have an irregular shape, which is 
determined by an outer envelope with distinctive, 
“club-shaped.” These peplomers give the virus a 
crown-like (corona) appearance. 7 The viral genome of 
coronaviruses consists of positive sense, single- 
stranded RNA of ~30 kb. The genome contains several 
genes that encode structural and nonstructural pro¬ 
teins that are required for progeny virion production. 1 
The virion envelope surrounding the nucleocapsid con¬ 
tains the following structural proteins: S (spike) pro¬ 
tein, M (membrane),and E (envelope). A third glyco¬ 
protein, HE (hemagglutinin-esterase), is present in 
most class II coronaviruses. 8,9 A helical nucleocapsid 
exists in the center of the viral particle. 10-12 Nucleo¬ 
capsid protein, the major structural protein of CoVs, 
binds to the viral RNA genome to form the virion 
core, which leads to the formation of a ribonucleopro- 
tein (RNP) complex or to a long helical nucleocapsid 
structure. 13,14 The formation of the RNP is important 
for maintaining the RNA in an ordered conformation 
suitable for replication and transcription of the viral 
genome. 13,15-17 Previous studies have shown that the 
CoV N protein is involved in the regulation of cellular 
processes such as gene transcription, actin reorganiza¬ 
tion, host cell cycle progression, and apoptosis. 18-21 It 
has also been shown to act as an RNA chaperone. 22 
Moreover, the N protein is an important diagnostic 
marker and the most “immuno-dominant” antigen in a 
host immune response. 17,23,24 

Previous studies have revealed that the N- and 
C-terminal domains of the CoV N proteins, including 
those of SARS-CoV, murine hepatitis virus (MHV), 
and avian infectious bronchitis virus (IBV), are re¬ 
sponsible for RNA binding and oligomerization, 
respectively. 25-30 The central region of the N protein 
has also been shown to contain an RNA-binding 
region and the primaiy sites of phosphoryla¬ 
tion. 28,31,32 Phosphorylation of the N protein has 
been shown to play an important role in virus biol¬ 
ogy. 33,34 To clarity the molecular mechanism of 
ribonucleocapsid formation by CoVs, structures of 
truncated fragments of N protein including N-termi- 
nal and C-terminal domains have been pub¬ 
lished. 26,35-37 Despite the conservation of some 
motifs, CoV N proteins from different strains often 
show quite different properties primarily due to their 
low sequence homology. 30 For example, Saikatendu 
et al. found that the structures of SARS-CoV and IBV 
N-terminal domains shared several common features, 
in addition to many subtle structural differences. 36 
On the basis of the different crystal packing of these 
two structures, they suggested that the two viruses 
most likely use different modes of oligomeric self¬ 
association during RNP core formation. 

The N protein of HC0V-OC43, which has a molec¬ 
ular weight of 50 kDa, is highly basic (pi, 10), and it 
shows strong hydrophilicity. 38 The N protein of HCoV- 
OC43 shows only 26-30% amino acids similarity to 


the N protein of other CoV strains. 30 Until now, few 
studies have examined the biochemical properties of 
the HC0V-OC43 N protein. In this study, we charac¬ 
terized the stability and biochemical properties of the 
recombinant full-length N protein from HC0V-OC43. 
RNA-binding and oligomerization regions of the 
HC0V-OC43 N protein were identified. 

Results 

Stability studies of recombinant HCoV-OC43 
nucleocapsid protein 

Conformational changes of the HC0V-OC43 N protein 
in response to temperatures were monitored using CD 
spectroscopy. As shown in Figure i(A), the CD spectra 
of HC0V-OC43 N protein were scanned from 190 to 
250 nm at 20, 45, 60, and 90°C. Analysis by the 
SELECON program revealed that HC0V-OC43 N 




Figure 1 . (A) CD spectra of HCoV-OC43 N protein at 
various temperatures including 20, 45, 60, and 90°C. The 
protein concentration was 5 pM and the buffer consisted of 
50 m/W Tris-HCI (pH 7.3), 150 m/W NaCI, and 0.1% CHAPS. 
(B) Thermostability measurements of HCoV-OC43 N protein 
monitored by CD spectra. The protein concentration was 
3.8 p/W and the buffer consisted of 50 m/W Tris-HCI (pH 
7.3), 150 m/W NaCI, and 0.1% CHAPS. 
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protein contains ~35-i% a-helices, n.6% p-sheets, 
23.8% turns, and 29.5% random coils at room temper¬ 
ature. At high temperatures, the CD spectra of HCoV- 
OC43 N protein showed decreased intensity between 
210 and 220 nm and a large negative peak at 190 nm, 
which are characteristic of disordered protein. Previ¬ 
ous CD spectra of the full-length N proteins from 
SARS and 229E CoVs indicated that the N protein is 
relatively disordered and composed of -.50% nonse- 
condaiy structures. 17 Our present results suggest that 
the conformations of HC0V-OC43 N proteins are more 
ordered than those of HC0V-229E and SARS-CoV N 
proteins [Fig. i(A)]. We also measured the melting 
temperature (T m ) of HC0V-OC43 N proteins using CD 
[Fig. i(B)]. Heat denaturation analysis showed that the 
T m of HC0V-OC43 N protein is 52°C under our condi¬ 
tions. The thermal denaturation of HC0-OC43 N pro¬ 
tein is reversible (data not shown). 

Denaturant reagents including urea were used to 
measure the denaturation of HC0V-OC43 N protein. 
We used tryptophan (Trp) fluorescence to monitor 
unfolding because the HC0V-OC43 N protein contains 
five Trp residues [Fig. 2(A)]. The fluorescence emis¬ 
sion spectra showed the maximal emission wavelength 
of the HC0V-OC43 N protein at ^-340 nm. Urea- 
induced unfolding caused a significant decrease in FL 
intensity at 340 nm and a red shift in the maximal 
emission wavelength. The unfolding of HC0V-OC43 N 
protein in the presence of urea was observed by moni¬ 
toring fluorescence spectra at 340 nm, and the denatu¬ 
ration curve was fitted to a two-state model [Fig. 
2(B)]. 39 The denaturation and renaturation curves 
were completely superimposable (data not shown). 
The denaturant molarity at the midpoint of the transi¬ 
tion (called C m ) was calculated as 4 M and the unfold¬ 
ing free energy change (AG°n-u) was determined from 
the linear extrapolation method to be 14.5 kJ/mol 
(Supporting Information Figure Si). 39 

Studies of nucleic acid binding by HCoV-OC43 
nucleocapsid proteins 

To measure the binding affinity between the HCoV- 
OC43 nucleocapsid protein and single-stranded nucleic 
acids, HC0V-OC43 nucleocapsid protein was allowed 
to interact with biotin-labeled single-stranded RNA or 
DNA, and the results were monitored by SPR. 
Depending on the virus strain, there are two to four 
UCUAA pentanucleotide repeats, with the last repeat 
being UCUAAAC and termed the intergenic (IG) 
sequence at the 3' end of the leader. 40,41 Previous 
studies showed that HCoV N protein has high affinity 
for the intergenic sequence. 42,43 Therefore, the 
repeated intergenic sequence of HC0V-OC43, 5'- 
(UCUAAAC)4-3', was used as a probe in our SPR and 
fluorescence experiments. For DNA, uracil was 
replaced by thymine. The results show that HCoV- 
OC43 N protein was able to interact with both DNA 
and RNA. However, HC0V-OC43 N protein showed a 
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Figure 2. (A) Fluorescence spectra of HCoV-OC43 N 
protein in Tris-HCI supplemented with different 
concentrations of urea. The protein concentration was 1 p M 
and the buffer consisted of 50 mlW Tris-HCI (pH 7.3), 

150 mM NaCI, and 0.1% CHAPS. (B) Urea-induced 
unfolding of HCoV-OC43 N protein monitored by 
fluorescence emission at 340 nm. The protein concentration 
was 1 p M and the buffer consisted of 50 mM Tris-HCI 
(pH 7.3), 150 mM NaCI, and 0.1% CHAPS. 

higher binding capacity for RNA (^4989 RU) than for 
DNA (--2803 RU) at the same protein concentration 
[Fig. 3(A)]. 

Kinetic experiments were carried out by meas¬ 
uring the binding affinity between HC0V-OC43 N pro¬ 
tein and single-stranded RNA or DNA. Four concen¬ 
trations of N protein were used to determine both the 
k a and k d . Figure 3(B) shows the Biacore SPR traces of 
N protein bound to RNA. The k a and k,\ were calcu¬ 
lated from the association and dissociation phases of 
the SPR traces, respectively. The k., and k<j of RNA 
were 4.82 x 10 4 AT 1 s _1 and 5.75 x io -4 s _1 , respec¬ 
tively. The kinetic parameters of DNA showed a lower 
k a and a higher k a of ~2.8s x 10 4 AT 1 s _1 and 1.69 x 
io -3 s“\ respectively. The dissociation constants (A (1 ) 
of HC0V-OC43 N protein with target nucleic acid 
strands were calculated as k^/k a (in M). The K { \ of the 
N protein bound to RNA was 11.9 nAf, which is close 
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Figure 3. (A) Binding capacity sensorgrams of HCoV-OC43 
N protein (0.7 p/W) bound to single-stranded RNA or single- 
stranded DNA. The RNA and DNA sequences immobilized 
on the SA chip were 5'-(UCUAAAC)4-3' and 5'- 
(TCTAAAC)4-3', respectively. (B) Sensorgrams of HCoV- 
OC43 N protein bound to single-stranded RNA. HCoV- 
OC43 N protein was exposed to RNA at concentration of 
0.7, 0.35, 0.175, and 0.0875 p/W. The buffer consisted of 50 
m/W Tris-HCI (pH 7.3) containing 150 mM NaCI and 0.1% 
CHAPS. The association rate increased with an increasing 
concentration of target N protein and the dissociation rate 
was independent of concentration. The fitted data was 
shown in Supporting Information (Figure S3). 

to the previously reported K,\ of --14 nM for the 
interaction between MHV N protein and RNA. 42 As 
expected, the for N protein and DNA was approxi¬ 
mately fivefold higher than that for N protein and 
RNA. 

Identification of the RNA-binding region of 
HCoV-OC43 nucleocapsid protein 

According to the PONDR prediction, HC0V-OC43 N 
protein contains two structural domains located at the 
N-terminal and C-terminal regions, and between them 
lies a central-linker region that is predicted to be dis¬ 
ordered [Fig 4(A)]. 30 To further isolate the RNA-bind¬ 
ing region of HC0V-OC43 N protein, we expressed and 
purified the truncated forms of the N protein: one 
construct contained only the N-terminal domain (Ni- 
173), another contained only the C-terminal domain 
(N301-448), and two contained different parts of the 
central-linker region (N174-232 and N233-300). 


These domain boundaries are based on the PONDR 
predictions [Fig. 4(A)]. The binding of each truncated 
N protein to RNA was analyzed by SPR [Fig. 4(B)]. 
These studies showed that the N-terminal domain 
(N1-173) and the central-linker region (N174-232 and 
N233-300), but not the C-terminal domain, bound to 
RNA [Fig. 4(B)]. The kinetic constants for association 
(k a in M 1 s ') and dissociation (k,\ in s ') of the trun¬ 
cated N proteins with respect to singled-stranded RNA 
have been calculated and listed in Table I. The k a val¬ 
ues were essentially the same: 2.18 x 10 4 and 2.48 x 
10 4 M 1 s ' for N1-173 and N233-300, respectively, 
and 7.97 x to 3 AT 1 s _1 for N174-232. Moreover, the 
/c (i values of N1-173 and N233-300 were essentially 
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Figure 4. (A) Order/disorder prediction for HCoV-OC43 N 
protein using the VL3-BA predictor of the PONDR program. 
(B) Sensorgrams of the truncated forms of HCoV-OC43 N 
protein including N1-173, N174-232, N233-300, and N301- 
448 bound to single-stranded RNA. Protein was tested at 
concentrations of 2 p/W in buffer containing 50 m M Tris-HCi 
(pH 7.3), 150 m M NaCI, and 0.1% CHAPS. The RNA 
immobilized on the SA chip was 5'-(UCUAAAC)4-3'. In 
N233-300, bulk effects caused by the differences in the 
refractive index of the running buffer and sample solution 
caused a sharp descent in the curves from 140 s to 142 s; 
hence, the dissociation measurements were started at 
142 s (marked by the arrow). 
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Table I. The Kinetic Parameters of the Binding of Full-Length and Truncated N Proteins 
to Single-Stranded RNA, 5 '-(UCUAAAQ 4 - 3 1 


Construct 

k a (IkT 1 s- 1 ) 

k A (s- 1 ) 


a X(M) 

Full-length 

4.82 ± 0.21 X to 4 

5.75 ± 0.23 X 

10- 4 

11.9 ± 1.0 X 10~ 9 

Nl-173 

2.18 ± 0.34 X to 4 

1.5 ± 0.21 X 

10~ 3 

68.9 ± 8.4 X 10~ 9 

N174-232 

7.97 ± 0.62 x to 3 

3-53 ± 0.32 x 

10~ 3 

44.4 ± 3.7 X 10 3 

N233-300 

2.48 ± 0.28 X 10 4 

1.58 ± 0.24 X 

10~ 3 

64.1 ± 7.2 X 10~ 9 

N301-448 

b_ 

— 


— 


a K d values were obtained by dividing fc d by fc a . 
b No significant binding was observed. 


the same: 1.50 x io~ 3 and 1.58 x io~ 3 s _1 , respec¬ 
tively. These constants were smaller than that of 
N174-232 (3.53 x 10 3 s ')■ The dissociation con¬ 
stants (/Q) of all truncated N proteins with RNA were 
calculated as k^/k a (in M) and they follow the order: 
N174-232 > N1-173-N233-300. Our results suggest 
that the N protein of HC0V-OC43 contains three RNA- 
binding regions. 

Oligomerization of the HCoV-OC43 
nucleocapsid protein 

The self-association characters of SARS-CoV and 
HC0V-229E N proteins has been reported previ¬ 
ously. 17,44 We further characterized the oligomerization 
of full-length HC0V-OC43 N proteins using crosslink¬ 
ing assays. As shown in Figure 5(A), HC0V-OC43 N 
protein was able to form dimers, trimers, tetramers, 
and higher-order multimers as the glutaraldehyde con¬ 
centrations was increased. To evaluate whether elec¬ 
trostatic interactions stabilize the compact conforma¬ 
tions of higher-order multimers, the oligomeric 
behaviors of N protein at different ionic strengths 
were studied. Briefly, the purified full-length N 
proteins, which were in a buffer containing sodium 
chloride, were cross-linked by 0.04% glutaraldehyde. 
In addition, we analyzed the oligomerization of N pro¬ 
tein using analytical gel filtration chromatography 
[Fig. 5(B)]. N protein eluted with a retention volume 
of —49.5 mL, corresponding to an apparent molecular 
weight of 621 kDa, based on the standard curve. This 
implies an oligomer containing ~12 molecules. 

Identification of the oligomerization region of 
HCoV-OC43 nucleocapsid protein 

To determine the regions that are of greatest impor¬ 
tance for HC0V-OC43 N protein oligomerization, sev¬ 
eral regions of N protein were purified and tested 
using crosslinking assays and analytical gel filtration. 
As shown in Figure 6(A), crosslinking studies of 
N233-448 detected dimers, tetramers, and oligomers. 
A shorter form of the protein, N301-448, also formed 
dimers, trimers, and oligomers [Fig. 6(B)], Gel filtra¬ 
tion analysis showed a peak of N301-448 at high 
molecular weight (225 kDa), consistent with a ~12- 
mer [Fig. 6(C)]. The crosslinking assay was also con¬ 
ducted to investigate oligomerization of the N-terminal 


(N1-173) and middle segment (N174-232) of N pro¬ 
tein. Both truncated N proteins remained as mono¬ 
mers under these experimental conditions [Fig. 7], 
suggesting that these regions are not involved in the 
oligomerization. Our data showed that the 148 amino 
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Figure 5. (A) Crosslinking assay of full-length N protein in 
the presence of various concentrations of glutaraldehyde. 
The concentration of the protein was 2 \iM. (B) Gel filtration 
chromatography-based analysis (Superdex 300 XK16/70) of 
N protein dissolved in 50 mM Tris-HCI (pH 7.3), 150 mM 
NaCI, and 0.1 % CHAPS. The concentration of the target 
protein was 3 y\M. Standard proteins of 669 kDa, 158 kDa, 
67 kDa, and 43 kDa are denoted as 1, 2, 3, and 4, 
respectively. 
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Figure 6. (A) Crosslinking assay of N233-448 in the 
presence of glutaraldehyde at various concentrations. The 
concentration of target protein was 2 yiM. (B) Crosslinking 
assay of N301-448 in the presence of the indicated 
concentrations of glutaraldehyde. The concentration of 
target protein was 3 p/W. (C) Gel filtration chromatography 
based analysis (Superdex 200 XK16/70) of N301-448 
buffered by 50 mM Tris-HCI with 150 mM NaCI and 0.1% 
CHAPS at pH 7.3. The concentration of target protein was 
5 p/W. The protein markers of 669 kDa, 223 kDa, 67 kDa, 
and 43 kDa are denoted as 1, 2, 3, and 4, respectively. 


acids at the C-terminus were the oligomerization do¬ 
main of HC0V-OC43 N protein. 


Discussion 

We have previously analyzed the full-length SARS-CoV 
and HC0V-229E N proteins by CD and found them to 
be relatively disordered, showing nearly 50% disor¬ 
dered structures. 17 Here, we showed that the HCoV- 
OC43 N protein possesses a more ordered structure 
than other human coronavirus N proteins, including 
those of SARS and 229E, which feature 35% a-helix 
content (Supporting Information Figure S2). Unlike 
SARS-CoV and HC0V-229E N proteins, the recombi¬ 
nant HC0V-OC43 N protein is highly resistant to pro¬ 
teolysis. 17 According to previous heat and urea 




M 0 0.017 0.050 0.083 0.133 


Monomer 

Figure 7. (A) Crosslinking assay of N1-173 in the presence 
glutaraldehyde at various concentrations. The concentration 
of target protein was 2 p/W. (B) Crosslinking assay of 
N174-232 in the presence of the indicated concentrations 
of glutaraldehyde. The concentration of target protein 
was 1.5 p/W. 
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denaturation analyses, the 7 ’ m and C m values of SARS- 
CoV N protein are 38°C and 2.77 M, respectively, 
implying a low stability of the SARS-CoV N protein. 45 
In this work, the 7 ' m and C m of the HC0V-OC43 N pro¬ 
tein were found to be 52°C and 4 M, respectively. Our 
results suggest that the HC0V-OC43 N protein is more 
stable than the SARS-CoV N protein. 45 

In this study, we have shown that the recombi¬ 
nant HC0V-OC43 N protein expressed in E. coli binds 
to the intergenic sequence of the RNA genomic leader. 
This binding demonstrates that the obtained recombi¬ 
nant N protein was properly folded. We examined the 
binding of recombinant HC0V-OC43 protein for 
nucleic acids. Our kinetic data showed that HCoV- 
OC43 N protein binds preferentially to single-stranded 
RNA over the corresponding single-stranded DNA. 
The decreased affinity for DNA may be due to two fac¬ 
tors. First, the 2'-hydroxyl group of ribose in RNA 
may form hydrogen bonds with residues of the N pro¬ 
tein. Second, in DNA, the methyl group on the C5 of 
thymine may create a steric clash with residues within 
the RNA-binding groove. The Hiscox group suggested 
that phosphorylation of the N protein determined the 
recognition of virus RNA because phosphorylation 
may alter the conformation of N protein, and thus 
affect its RNA-binding sites. 12,33,34 In addition, 
Mohandas et al. have reported that dephosphorylation 
of MHV N protein by cellular phosphoprotein phos¬ 
phatase may facilitate the infectious process. 46 How¬ 
ever, the role of phosphorylation in the RNA-binding 
sites of HC0V-OC43 N protein was not investigated in 
our study because the N protein was expressed in E 
coli. It would be interesting to study the effects of 
phosphorylation on the conformation and RNA-bind¬ 
ing properties of HC0V-OC43 N protein in the future. 

Previous studies have reported that the N-termi- 
nus of SARS-CoV N protein provides a scaffold for 
RNA binding. 29,36,47 X-ray analysis revealed that the 
fold of the N-terminal domain of N protein is essen¬ 
tially conserved across the various CoV groups. It has 
a U-shaped structure with the two arms rich in basic 
residues and the flexible loops well-ordered around 
the ( 1 -sheet core of the N-terminal domain. 26,36 
Spencer and Hiscox found that the N-terminal region 
of IBV protein facilitates long-range, nonspecific inter¬ 
actions between N protein and viral RNA, thus leading 
to the formation of the ribonucleocapsid via a lure and 
lock mechanism. 29 Similar to the N-terminal domain 
of SARS-CoV and IBV N proteins, the N-terminal do¬ 
main of HC0V-OC43 N protein contains many posi¬ 
tively charged residues and has been shown by SPR to 
be responsible for RNA binding. In addition to the N- 
terminal domain (N1-173), two highly positively 
charged portions of the central linker region, N174- 
232 and N233-300, also show RNA-binding activity. 
Given their high pi values, these RNA-binding regions 
may rely on electrostatic interactions to interact with 
RNA. Compared with N1-173 and N233-300, N174- 


232 has lower RNA-binding affinity and, therefore, 
appears to play an auxiliary role in RNA binding, de¬ 
spite the presence of a serine/arginine motif (SR-rich 
motif) that is usually involved in RNA recognition. 48 
In a previous study, the SR-rich motif of MHV N pro¬ 
tein was shown to be the RNA-binding region. 28,42 In 
addition, the C-terminal domain of SARS-CoV N pro¬ 
tein has been proposed to bind nucleic acids and to 
participate in genomic RNA-binding. 25,49 However, 
our studies suggest that the C-terminal domain 
(N301-448) of the HC0V-OC43 N protein does not 
bind RNA. 

A previous study showed that SARS-CoV N pro¬ 
tein forms high-order oligomers through the action of 
its C-terminal 138 amino acids. These oligomers exist 
predominantly as dimers. 44,50 The N-terminal domain 
allows for the association of dimers to form higher- 
order oligomers. Crystal structures of the C-terminal 
domain of SARS-CoV and IBV N proteins show a simi¬ 
lar general polypeptide fold, which strongly suggests 
that the dimerized N protein is the functional unit in 
vivo for the four groups of coronaviruses. 27,35 The 
crystal structure of the C-terminal domain shows a 
tightly intertwined, twofold symmetric C-terminal do¬ 
main dimer with a |i-hairpin (f)i and (S2) from one 
subunit extending into the cavity of the opposite subu¬ 
nit; this forms an antiparallel (1-sheet with hydrogen 
bonds across the dimer interface. Luo et al . 49 have 
reported that this dimer association does not involve 
electrostatic forces. These authors have further sug¬ 
gested that the SR-rich motif of the SARS-CoV N pro¬ 
tein binds to the central region and thereby triggers 
the multimerization of dimers. 51 In addition, Fan et al. 
showed that the N-terminal region of the IBV N pro¬ 
tein is involved in the oligomerization of N protein. 26 
Here, we analyzed the self-association properties of 
the HC0V-OC43 N protein and found it to form 
oligomers. We identified the C-terminal 148 amino 
acids of HC0V-OC43 N protein to be the oligomeriza¬ 
tion domain. When expressed in isolation, this region 
of the C-terminus formed oligomers at only micromo¬ 
lar concentrations, and this oligomerization occurred 
in the absence of the N-terminal domain. The other 
truncated constructs of HC0V-OC43 N protein, Ni- 
173 and N174-232, were found to remain as mono¬ 
mers under all of the tested experimental conditions, 
suggesting that these two regions are not involved in 
oligomerization during ribonucleocapsid formation. 

In summary, we found the HC0V-OC43 N protein 
exhibited specific properties compared with other 
CoVs. For example, HC0V-OC43 N protein is more 
highly ordered and stabler than the SARS-CoV protein 
previously studied. 17,45 We also showed that HCoV- 
OC43 N protein is composed of three RNA-binding 
regions lying within the N-terminal region (residues 
1-173) and central-linker region (residues 174-232 
and 233-300) that bind to RNA in a cooperative man¬ 
ner. Although the C-terminal region (N301-448) lacks 
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RNA-binding activity, this region plays an important 
role in the oligomerization of the HC0V-OC43 N pro¬ 
tein. This study may benefit the development of drugs 
to disrupt the binding of viral N protein with RNA 
and viral assembly. 

Materials and Methods 

Drugs and reagents were purchased from Sigma 
Chemical Co. All oligoribonucleotides and oligodeoxyr- 
ibonucleotides were synthesized using an automated 
DNA synthesizer and purified by gel electrophoresis. 
The biotin-linked oligomers were synthesized by incor¬ 
porating the biotin synthon at the g'-end of the 
oligomers; oligomers were then immobilized to the 
streptavidin-coated biosensor chip used for the surface 
plasmon resonance (SPR) experiments. 

Expression and purification of the full-length 
and truncated N proteins 

The templates for the HC0V-OC43 N protein were pro¬ 
vided by the Institute of Biological Chemistry, Aca¬ 
demia Sinica (Taipei, Taiwan). To generate both full- 
length and truncated forms of the recombinant N pro¬ 
teins, the N protein gene was amplified by polymerase 
chain reaction (PCR) from plasmid pGENT using vari¬ 
ous primers. The PCR products were digested with 
Ndel and Xhol, and the DNA fragments were cloned 
into pET28a (Novagen) using T4 ligase (NEB). Bacte¬ 
ria transformed with the resultant plasmid were grown 
in culture. Protein expression was induced by supple¬ 
menting the culture medium with IPTG to 1 m M fol¬ 
lowed by incubation at io°C for 24 h. After harvesting 
the bacteria by centrifugation (35005, 30 min, 4°C), 
the bacterial pellets were lysed with lysis buffer (50 
m M Tris-buffered solution, pH 7.3, 150 m M NaCl, 
0.1% CHAPS, and 15 m M imidazole). Soluble proteins 
were obtained from the supernatant following centrifu¬ 
gation (15,000 rpm, 30 min, 4°C) to remove the pre¬ 
cipitate. Full-length and truncated N proteins carrying 
a His6-tag at their N-termini were purified using a Ni- 
NTA column (Novagen) with an elution gradient rang¬ 
ing from 15 to 300 m M imidazole. The pure fractions 
were collected and dialyzed against low-salt buffer. 
Since the N protein is positively-charged, it was fur¬ 
ther purified by SP cation exchange chromatography 
using a gradient from 0.05 M to 1.5 M NaCl in 50 mM 
Tris at pH 7.3, 0.1% CHAPS. The protein concentra¬ 
tions were determined using the Bradford method 
with Bio-Rad protein assay reagents. 

Circular dichroism spectroscopy 

Circular dichroism (CD) spectra were obtained using a 
JASCO-815 CD spectropolarimeter. Temperature was 
controlled by circulating water at the desired tempera¬ 
ture in the cell jacket. Each protein were dissolved in 
50 m M Tris-buffered solution, pH 7.3, 150 m M NaCl, 
and 0.1% CHAPS. The CD spectra were collected 
between 250 and 190 nm with 1 nm bandwidth at 


1 nm intervals. All spectra were obtained from an aver¬ 
age of five scans. The photomultiplier absorbance did 
not exceed 600 V during the analysis. CD spectra were 
normalized by subtraction of the background scan 
with buffer alone. The mean residue ellipticity, [ 0 ], 
was calculated based on the equation, [0] = 

MRW x 0 )./1 o x 1 x e, where MRW is the mean residue 
weight, 0>. is the measured ellipticity in milidegrees at 
wavelength X, l is the cuvette pathlength (0.1 cm), and 
c is the protein concentration in g/mL. The results 
were analyzed using the SELCON3 program to calcu¬ 
late the percentage of each type of secondary struc¬ 
ture. 52 In addition, T m was determined from the poly¬ 
nomial fitting of the observed curve and taken as the 
temperature corresponding to half denaturation of the 
N protein. The first derivative of absorption with 
respect to temperature, cL 4 /d 7 ', of the melting curve 
was computer-generated and used for determining 
the T m . 

Fluorescence spectroscopy 

In the urea-induced unfolding experiment, N protein 
was added to a final concentration of 1 pM to buffer 
containing various concentrations of urea, and the 
samples were incubated at 25°C for 30 min. The buffer 
consisted of 50 m M Tris, pH 7.3, and 0.1% CHAPS. 
Tryptophan fluorescence measurements were made 
using an F-4500 fluorescence spectrophotometer 
(Hitachi) equipped with a cuvette with a 1 cm light 
path. The excitation wavelength was 288 nm, and 
the emission data were collected between 300 and 
400 nm. 

Surface plasmon resonance binding 
experiments 

The affinity, association, and dissociation of N proteins 
and RNA were measured in a BIAcore 3000A SPR 
instrument (Pharmacia, Uppsala, Sweden) equipped 
with a SensorChip SA5 from Pharmacia; the apparatus 
measured binding by monitoring the refractive index 
change of the sensor chip surface. These changes, 
recorded in resonance units (RU), are generally 
assumed to be proportional to the mass of the mole¬ 
cules bound to the chip. The surface was first washed 
three times by injecting 10 pL of too m M NaCl solu¬ 
tion with 50 m M NaOH. To control the amount of 
RNA (or DNA) bound to the SA chip surface, the bio¬ 
tinylated oligomer was immobilized manually onto the 
surface of a streptavidin chip until a signal of 1200 RU 
was achieved in the first cell. The chip surface was 
then washed with 10 pL of 10 mM HC 1 to eliminate 
non-specific binding. The second flow cell was 
unmodified and served as a control. The appropriate N 
proteins were dissolved in 50 m M Tris (pH 7.3) with 
150 m M NaCl and 0.1% CHAPS, and passed over the 
chip surface for 140 s at a flow rate of 30 pL/min to 
reach equilibrium. Blank buffer solution was then 
passed over the chip to initiate the dissociation 
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reaction; this step was continued for an additional 
600 s to complete the reaction. After 600 s, the sur¬ 
face was recovered by washing with 10 pL of 0.1% SDS 
for each single-stranded RNA (or DNA). Before fitting 
to the 1:1 Langmuir model, binding data were cor¬ 
rected by subtraction of the control to account for 
simple refractive index differences. Sensorgrams for 
interactions between RNA (or DNA) and protein were 
analyzed using BIA evaluation software (version 3). To 
verify if mass transport effects may arise during the 
interaction between N protein and immobilized RNA, 
binding experiments were performed at different flow 
rates and different surface binding capacity. It was 
found that the binding interaction between N protein 
and RNA is not a mass transport-limited process 
because the derived kinetic parameters (especially the 
association rate constant, k R ) are independent of the 
flow rate and binding capacity. 

Chemical crosslinking assay 

To investigate the oligomerization features of N pro¬ 
teins, a chemical crosslinking experiment was per¬ 
formed. A series of protein solutions containing N pro¬ 
teins was supplemented with various concentrations of 
glutaraldehyde, and each reaction mixture was incu¬ 
bated at room temperature for 5 min. The reaction 
was stopped by the addition of 1 M Tris. pH 7.3 (0.5%, 
v/v, final concentration) and incubation on ice. The 
sample solution was then analyzed by SDS-PAGE. 

Analytical gel filtration chromatography 

Gel filtration experiments were performed using fast 
protein liquid chromatography (Amersham Bioscien¬ 
ces) equilibrated with buffer containing 50 m M Tris- 
HC 1 (pH 7.3), 150 m M NaCl, and 0.1% CHAPS at a 
flow rate of 0.5 mL/min. Blue dextran was used to 
determine of the void volume (Vo). Several proteins of 
known molecular weight were used as standards (thy- 
roglobulin, 669 kDa; catalase, 232 kDa; aldolase, 158 
kDa; albumin, 67 kDa; ovalbumin, 43 kDa; chymo- 
tiypsinogen A, 25 kDa and ribonuclease A, 13.7 kDa, 
purchased from GE Healthcare), and their elution vol¬ 
umes (V e ) were determined. The standard curve was 
plotted as the logarithm of the molecular weight 
against V e /V 0 for the standard proteins. 

Acknowledgment 

We thank Dr. Tai-Huang Huang (IBS, Academia Sinica) 
for his helpful discussions and guidance. This manu¬ 
script was edited by American Journal Expert (AJE). 

References 

1. St-Jean JR, Jacomy H, Desforges M, Vabret A, Freymuth 
F, Talbot PJ (2004) Human respiratory coronavirus 
OC43: genetic stability and neuroinvasion. J Virol 78: 
8824-8834. 


2. Vabret A, Mourez T, Gouarin S, Petitjean J, Freymuth F 
(2003) An outbreak of coronavirus OC43 respiratory 
infection in Normandy. Fr Clin Infect Dis 36:985-989. 

3. El-Sahly HM, Atmar RL, Glezen WP, Greenberg SB 
(2000) Spectrum of clinical illness in hospitalized 
patients with “common cold” virus infections. Clin Infect 
Dis 31:96-100. 

4 . Gagneur A, Sizun J, Vallet S, Legr MC, Picard B, Talbot 
PJ (2002) Coronavirus-related nosocomial viral respira¬ 
tory infections in a neonatal and paediatric intensive care 
unit: a prospective study. J Hosp Infect 51:59-64. 

5. Wenzel RP, Hendley JO, Davies JA, Gwaltney JM, Jr 
(1974) Coronavirus infections in military recruits. Three- 
year study with coronavirus strains OC43 and 229E. Am 
Rev Respir Dis 109:621-624. 

6. Arbour N, Day R, Newcombe J, Talbot PJ (2000) Neuro¬ 
invasion by human respiratory coronaviruses. J Virol 74: 
8913-8921. 

7. Leng Q, Bentwich Z (2003) A novel coronavirus and 
SARS. N Engl J Med 349:709. 

8. Hogue BG, Brian DA (1986) Structural proteins of 
human respiratory coronavirus OC43. Virus Res 5: 
131-144. 

9. Lai MM, Cavanagh D (1997) The molecular biology of 
coronaviruses. Adv Virus Res 48:1-100. 

10. Hiscox JA (2007) RNA viruses: hijacking the dynamic 
nucleolus. Nat Rev Microbiol 5:119-127. 

11. You JH, Reed ML, Dove BK, Hiscox JA (2006) Three- 
dimensional reconstruction of the nucleolus using meta- 
confocal microscopy in cells expressing the coronavirus 
nucleoprotein. Adv Exp Med Biol 581:313-318. 

12. Stohlman SA, Fleming JO, Patton CD, Lai MM (1983) 
Synthesis and subcellular localization of the murine coro¬ 
navirus nucleocapsid protein. Virology 130:527-532. 

13. Masters PS, Parker MM, Ricard CS, Duchala C, Frana 
MF, Holmes KV, Sturman LS (1990) Structure and func¬ 
tion studies of the nucleocapsid protein of mouse hepati¬ 
tis virus. Adv Exp Med Biol 276:239-246. 

14. Masters PS, Sturman LS (1990) Background paper. Func¬ 
tions of the coronavirus nucleocapsid protein. Adv Exp 
Med Biol 276:235-238. 

15. Pyre K, Jebbink MF, Berkhout B, Van Der Hoek L (2004) 
Genome structure and transcriptional regulation of 
human coronavirus NL63. Virol J 1:7. 

16. Baric RS, Nelson GW, Fleming JO, Deans RJ, Keck JG, 
Casteel N, Stohlman SA (1988) Interactions between co¬ 
ronavirus nucleocapsid protein and viral RNAs: implica¬ 
tions for viral transcription. J Virol 62:4280-4287. 

17. Tang TK, Wu MP, Chen ST, Hou MH, Hong MH, Pan 
FM, Yu HM, Chen JH, Yao CW, Wang AH (2005) Bio¬ 
chemical and immunological studies of nucleocapsid pro¬ 
teins of severe acute respiratory syndrome and 229E 
human coronaviruses. Proteomics 5:925-937. 

18. Hsieh PK, Chang SC, Huang CC, Lee TT, Hsiao CW, Kou 
YH, Chen IY, Chang CK, Huang TH, Chang MF (2005) 
Assembly of severe acute respiratory syndrome coronavi¬ 
rus RNA packaging signal into virus-like particles is nu¬ 
cleocapsid dependent. J Virol 79:13848-13855. 

19. Kopecky-Bromberg SA, Martinez-Sobrido L, Frieman M, 
Baric RA, Palese P (2007) Severe acute respiratory syn¬ 
drome coronavirus open reading frame (ORF) 3b, ORF 6, 
and nucleocapsid proteins function as interferon antago¬ 
nists. J Virol 81:548-557. 

20. Du L, Zhao G, Lin Y, Chan C, He Y, Jiang S, Wu C, Jin 
DY, Yuen KY, Zhou Y, Zheng BJ (2008) Priming with 
rAAV encoding RBD of SARS-CoV S protein and boosting 
with RBD-specific peptides for T cell epitopes elevated 
humoral and cellular immune responses against SARS- 
CoV infection. Vaccine 26:1644-1651. 


Huang et al. 


VOL 18:2209-2218 2217 


21. Surjit M, Liu B, Chow VT, Lai SK (2006) The nucleocap- 
sid protein of severe acute respiratory syndrome-corona- 
virus inhibits the activity of cyclin-cyclin-dependent 
kinase complex and blocks S phase progression in mam¬ 
malian cells. J Biol Chem 281:10669-10681. 

22. Zuniga S, Sola I, Moreno JL, Sabella P, Plana-Duran J, 
Enjuanes L (2007) Coronavirus nucleocapsid protein is 
an RNA chaperone. Virology 357:215-227. 

23. Chan KH, Cheng VC, Woo PC, Lau SK, Poon LL, Guan Y, 
Seto WH, Yuen KY, Peiris JS (2005) Serological 
responses in patients with severe acute respiratory syn¬ 
drome coronavirus infection and cross-reactivity with 
human coronaviruses 229E, OC43, and NL63. Clin Diagn 
Lab Immunol 12:1317-1321. 

24. Mourez T, Vabret A, Han Y, Dina J, Legrand L, Corbet S, 
Freymuth F (2007) Baculovirus expression of HCoV- 
OC43 nucleocapsid protein and development of a West¬ 
ern blot assay for detection of human antibodies against 
HC0V-OC43. J Virol Methods 139:175-180. 

25. Chen CY, Chang CK, Chang YW, Sue SC, Bai HI, Riang 
L, Hsiao CD, Huang TH (2007) Structure of the SARS co¬ 
ronavirus nucleocapsid protein RNA-binding dimerization 
domain suggests a mechanism for helical packaging of vi¬ 
ral RNA. J Mol Biol 368:1075-1086. 

26. Fan H, Ooi A, Tan YW, Wang S, Fang S, Liu DX, Lescar J 
(2005) The nucleocapsid protein of coronavirus infectious 
bronchitis virus: crystal structure of its N-terminal domain 
and multimerization properties. Structure 13:1859-1868. 

27. Jayaram H, Fan H, Bowman BR, Ooi A, Jayaram J, Col- 
lisson EW, Lescar J, Prasad BV (2006) X-ray structures 
of the N- and C-terminal domains of a coronavirus nu¬ 
cleocapsid protein: implications for nucleocapsid forma¬ 
tion. J Virol 80:6612-6620. 

28. Nelson GW, Stohlman SA (1993) Localization of the 
RNA-binding domain of mouse hepatitis virus nucleocap¬ 
sid protein. J Gen Virol 74 (Part 9):l975-l979- 

29. Spencer KA, Hiscox JA (2006) Characterisation of the 
RNA binding properties of the coronavirus infectious 
bronchitis virus nucleocapsid protein amino-terminal 
region. FEBS Lett 580:5993-5998. 

30. Chang CK, Sue SC, Yu TH, Hsieh CM, Tsai CK, Chiang 
YC, Lee SJ, Hsiao HH, Wu WJ, Chang WL, Lin CH, 
Huang TH (2006) Modular organization of SARS corona¬ 
virus nucleocapsid protein. J Biomed Sci 13:59-72. 

31. Masters PS (1992) Localization of an RNA-binding do¬ 
main in the nucleocapsid protein of the coronavirus 
mouse hepatitis virus. Arch Virol 125:141-160. 

32. Peng D, Koetzner CA, Mcmahon T, Zhu Y, Masters PS 
(1995) Construction of murine coronavirus mutants con¬ 
taining interspecies chimeric nucleocapsid proteins. J 
Virol 69:5475-5484. 

33. Spencer KA, Dee M, Britton P, Hiscox JA (2008) Role of 
phosphorylation clusters in the biology of the coronavirus 
infectious bronchitis virus nucleocapsid protein. Virology 
370:373-381. 

34. Chen H, Gill A, Dove BK, Emmett SR, Kemp CF, Ritchie 
MA, Dee M, Hiscox JA (2005) Mass spectroscopic char¬ 
acterization of the coronavirus infectious bronchitis virus 
nucleoprotein and elucidation of the role of phosphoryla¬ 
tion in RNA binding by using surface plasmon resonance. 
J Virol 79:1164-1179. 

35. Yu IM, Oldham ML, Zhang J, Chen J (2006) Crystal 
structure of the severe acute respiratory syndrome 
(SARS) coronavirus nucleocapsid protein dimerization 
domain reveals evolutionary linkage between corona- and 
arteriviridae. J Biol Chem 281:17134-17139. 


36. Saikatendu KS, Joseph JS, Subramanian V, Neuman BW, 
Buchmeier MJ, Stevens RC, Kuhn P (2007) Ribonucleo- 
capsid formation of severe acute respiratory syndrome 
coronavirus through molecular action of the N-terminal 
domain of N protein. J Virol 81:3913-3921. 

37. Takeda M, Chang CK, Ikeya T, Guntert P, Chang YH, 
Hsu YL, Huang TH, Kainosho M (2008) Solution struc¬ 
ture of the c-terminal dimerization domain of SARS coro¬ 
navirus nucleocapsid protein solved by the SAIL-NMR 
method. J Mol Biol 380:608-622. 

38. Kamahora T, Soe LH, Lai MM (1989) Sequence analysis 
of nucleocapsid gene and leader RNA of human coronavi¬ 
rus OC43. Virus Res 12:1-9. 

39. Bolen DW, Santoro MM (1988) Unfolding free energy 
changes determined by the linear extrapolation method. 
II. Incorporation of delta G degrees N-U values in a ther¬ 
modynamic cycle. Biochemistry 27:8069-8074. 

40. Wang Y, Zhang X (2000) The leader RNA of coronavirus 
mouse hepatitis virus contains an enhancer-like element 
for subgenomic mRNA transcription. J Virol 74: 
10571-10580. 

41. Sawicki SG, Sawicki DL, Siddell SG (2007) A contempo¬ 
rary view of coronavirus transcription. J Virol 81:20-29. 

42. Nelson GW, Stohlman SA, Tahara SM (2000) High affin¬ 
ity interaction between nucleocapsid protein and leader/ 
intergenic sequence of mouse hepatitis virus RNA. J Gen 
Virol 81:181-188. 

43. Welter LM, Stohlman SA, Deans RJ (1990) MHV leader 
RNA secondary structure affects binding to the nucleo¬ 
capsid protein. Adv Exp Med Biol 276:247-254. 

44. Yu IM, Gustafson CL, Diao J, Burgner JW, Li Z, Zhang J, 
Chen J (2005) Recombinant severe acute respiratory syn¬ 
drome (SARS) coronavirus nucleocapsid protein forms a 
dimer through its C-terminal domain. J Biol Chem 280: 
23280-23286. 

45. Wang Y, Wu X, Wang Y, Li B, Zhou FI, Yuan G, Fu Y, 
Luo Y (2004) Low stability of nucleocapsid protein in 
SARS virus. Biochemistry 43:11103-11108. 

46. Mohandas DV, Dales S (1991) Endosomal association of a 
protein phosphatase with high dephosphorylating activity 
against a coronavirus nucleocapsid protein. FEBS Lett 
282:419-424. 

47. Huang Q, Yu L, Petros AM, Gunasekera A, Liu Z, Xu N, 
Hajduk P, Mack J, Fesik SW, Olejniczak ET (2004) 
Structure of the N-terminal RNA-binding domain of the 
SARS CoV nucleocapsid protein. Biochemistry 43: 
6059-6063. 

48. Lin S, Fu XD (2007) SR proteins and related factors in 
alternative splicing. Adv Exp Med Biol 623:107-122. 

49. Luo H, Chen J, Chen K, Shen X, Jiang H (2006) Car¬ 
boxyl terminus of severe acute respiratory syndrome co¬ 
ronavirus nucleocapsid protein: self-association analysis 
and nucleic acid binding characterization. Biochemistry 
45:11827-11835. 

50. Chang CK, Sue SC, Yu TH, Hsieh CM, Tsai CK, Chiang 
YC, Lee SJ, Hsiao HH, Wu WJ, Chang CF, Huang TH 
(2005) The dimer interface of the SARS coronavirus nu¬ 
cleocapsid protein adapts a porcine respiratory and 
reproductive syndrome virus-like structure. FEBS Lett 
579:5663-5668. 

51. Luo H, Ye F, Chen K, Shen X, Jiang FI (2005) SR-rich 
motif plays a pivotal role in recombinant SARS coronavi¬ 
rus nucleocapsid protein multimerization. Biochemistry 
44:15351-15358. 

52. Sreerama N, Woody RW (1993) A self-consistent method 
for the analysis of protein secondary structure from circu¬ 
lar dichroism. Anal Biochem 209:32-44. 


2218 PROTEINSCIENCE.ORG 


Human Coronavirus OC43 Nucleocapsid Protein 


